首页> 外文OA文献 >Fast Bayesian Feature Selection for High Dimensional Linear Regression in Genomics via the Ising Approximation
【2h】

Fast Bayesian Feature Selection for High Dimensional Linear Regression in Genomics via the Ising Approximation

机译:高维线性回归的快速贝叶斯特征选择   在基因组学中通过伊辛近似

摘要

Feature selection, identifying a subset of variables that are relevant forpredicting a response, is an important and challenging component of manymethods in statistics and machine learning. Feature selection is especiallydifficult and computationally intensive when the number of variables approachesor exceeds the number of samples, as is often the case for many genomicdatasets. Here, we introduce a new approach -- the Bayesian Ising Approximation(BIA) -- to rapidly calculate posterior probabilities for feature relevance inL2 penalized linear regression. In the regime where the regression problem isstrongly regularized by the prior, we show that computing the marginalposterior probabilities for features is equivalent to computing themagnetizations of an Ising model. Using a mean field approximation, we show itis possible to rapidly compute the feature selection path described by theposterior probabilities as a function of the L2 penalty. We present simulationsand analytical results illustrating the accuracy of the BIA on some simpleregression problems. Finally, we demonstrate the applicability of the BIA tohigh dimensional regression by analyzing a gene expression dataset with nearly30,000 features.
机译:特征选择(确定与预测响应相关的变量子集)是统计和机器学习中许多方法的重要且具有挑战性的组成部分。当变量的数量接近或超过样本数量时,特征选择尤其困难且计算量大,这对于许多基因组数据集通常是这样。在这里,我们介绍一种新的方法-贝叶斯伊辛近似(BIA)-快速计算L2惩罚线性回归中与特征相关的后验概率。在先验强烈地解决了回归问题的情况下,我们表明,计算特征的边缘后验概率等效于计算伊辛模型的磁化强度。使用平均场近似,我们表明有可能快速计算由后验概率描述的特征选择路径作为L2惩罚的函数。我们提供的模拟和分析结果说明了BIA在一些简单回归问题上的准确性。最后,我们通过分析具有近30,000个特征的基因表达数据集,证明了BIA对高维回归的适用性。

著录项

  • 作者单位
  • 年度 2014
  • 总页数
  • 原文格式 PDF
  • 正文语种 {"code":"en","name":"English","id":9}
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号